Nativeness Classification with Suprasegmental Features on the Accent Group Level
نویسندگان
چکیده
We present a novel approach to discriminating native and nonnative utterances based on suprasegmental features extracted at the Accent Group (AG) level. Past studies have shown modeling a set of shared intonation patterns across AGs to be effective in predicting local f0 contour shapes. Here we demonstrate that AG level prosodic features are also effective in nativeness classification. The proposed suprasegmental feature set is very low dimensional, and is derived from f0 and energy contours across the AG, as well as normalized duration of the syllables within each AG. A Random Forest back end classifier is used to combine AG level scores from GMM and Decision Tree models, producing nativeness scores at the utterance level. The proposed prosodic nativeness classifier achieves 83.3% accuracy for 2-AG utterances and 89.1% accuracy for 3-AG utterances, exceeding a baseline Gaussian Supervector system’s performance by more than 10% absolute. The vastly lower dimensionality of the proposed feature set relative to the baseline method suggests the importance of suprasegmental features over traditional spectral cues in contributing to the perceived nativeness of a learner’s language.
منابع مشابه
Computer Assisted Pronunciation Teaching (CAPT) and Pedagogy: Improving EFL learners’ Pronunciation Using Clear Pronunciation 2 Software
This study examined the impact of Clear Pronunciation 2 software on teaching English suprasegmental features, focusing on stress, rhythm and intonation. In particular, the software covers five topics in relation to suprasegmental features including consonant cluster, word stress, connected speech, sentence stress and intonation. Seven Iranian EFL learners participated in this study. The study l...
متن کاملMimicked accents — Do speakers have similar cognitive prototypes?
There are several possible situations in which perpetrators might want to disguise their voices in order to avoid identification and to deflect the search for them to another person or group of individuals. One possible manner that can be used for voice disguise is the adoption of another accent. This paper examines the mimicking of the British-English Swedish accent, that is mimicking of the S...
متن کاملCombining multiple approaches to predict the degree of nativeness
Automatic speaker nativeness assessment has multiple applications, such as second language learning and IVR systems. In this paper we view this as a regression problem, since the available labels are on a continuous scale. Multiple approaches were applied, such as phonotactic models, i-vectors, and goodness of pronunciation, covering both segmental and suprasegmental features. Different phonota...
متن کاملThe strength of foreign accent in Czech English under adverse listening conditions
The study connects two major topics in current speech research: foreign accentedness and speech in adverse conditions. We parallel the research in intelligibility of non-native speech, but instead of linguistic unit recognition we focus on the perception of the foreign accent strength. First, the question of type and degree of perceptual deficiencies occurring along with certain types of signal...
متن کاملLanguage identification and accent variation detection in spoken language recordings
We develop a model for identifying languages and accents in audio recordings. Our Hierarchical-Sequential Nodule Model (HSNM) incorporates both short-distance features (which capture simple linguistic distinctions, e.g. phoneme inventories) and longdistance features (which detect long-distance suprasegmental patterns, e.g. tone and prosody) which help a classifier discriminate intelligently amo...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2012